Optimal Cloning of Pure States 
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Abstract. We construct the unique optimal quantum device for turning 
a finite number of <i-level quantum systems in the same unknown pure 
state a into M systems of the same kind, in an approximation of the 
2 ■ M-fold tensor product of the state a. 
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1. Introduction 



One of the fundamental features distinguishing quantum theory from classical theories is 
epitomized by the "No Cloning Theorem" [WZ]. The "quantum copiers" forbidden by 
this theorem, in much the same way as perpetual motion machines are forbidden by the 
Second Law of thermodynamics, are defined as follows: a copier takes one quantum system 
as input and produces as output two systems of the same kind. If one now runs experiments 
in which each input is prepared according to the same density matrix, either one of the 
outputs is discarded, and some measurement is then performed on the remaining output, 
one should get the same statistical results as measured directly on the inputs, for arbitrary 
initial preparations and final measurements. 

The impossibility of cloning machines is intimately connected to other impossible tasks 
of quantum theory, notably "joint measurement" and "teleportation" . It is well-known that 
there are some pairs of quantum observables (such as different spin components) which 
cannot be measured jointly on the same device. This statement implies the No-Cloning 
Theorem, since a quantum copier could be operated as a universal joint measuring device: 
one simply applies the two measuring devices in question to the two outputs of the copier. 
Hence a copier is a more powerful machine than a joint measuring device. On the other 
hand, there is a hypothetical machine even stronger that the copier: the "teleporter" , 
which is hence also forbidden by the No-Cloning Theorem. By definition, "teleportation", 
or "classical teleportation" to avoid confusion with the fundamental process of entangle- 
ment enhanced teleportation [BB + ], is the transmission of quantum states (or "quantum 
information") on classical channels. A teleporting device would consist of a measuring 
apparatus, which produces some classical output (a measuring result) from a quantum in- 
put, and a reconstruction apparatus, which prepares quantum systems, taking the results 
of the previous measurements into account. The criterion for successful teleportation is 
again the impossibility of distinguishing the outputs of the overall device from the inputs 
by statistical experiments. To make a copier from a teleporter would be easy: one simply 
has to make copies of the intermediate classical measuring results (which is a trivial oper- 
ation for classical data) and to run the reconstruction apparatus on each of these copies. 
Hence classical teleportation is also impossible. 

However, the impossibility of all these devices cannot be the end of the story. For 
example, while the No- Teleportation Theorem declares it impossible to determine a quan- 
tum state by the classical data obtained in a single measurement, it is clearly possible 
to determine quantum states by a run of statistical experiments. In fact, according to 
the statistical interpretation, a quantum state is nothing but a mathematical encoding of 
all data which can be collected in this way. Therefore, it must be possible to construct 
devices which take several identically prepared quantum systems as an input, make a mea- 
surement, and thereby determine the density matrix describing the preparation to any 
desired degree of accuracy. This is the problem of quantum state estimation, which has 
been studied by many authors [Hol,Hel,MP,DBE] . Of course, we can use this classical 
information to prepare many new systems ("clones") in a state which is a close approxi- 
mation of the input state. Clearly, the quality of the clones will depend on the number of 
initially available input systems. On the other hand, there will be no limit to the number 
of clones obtainable in this way, because the classical measuring result can be copied and 
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used arbitrarily often. 

More recently, there has been a new twist to this problem coming from the observation 
that if only a given number of clones is needed, the procedure via a classical intermediate 
stage is too wasteful [GM]. Indeed it has been shown [HB] that there is, in general, 
a tradeoff between the number of clones and their quality. Clearly, the optimal cloning 
machine giving a fixed number of copies from a fixed number of identically prepared systems 
cannot operate via an intermediate classical stage: It has to stay entirely in the quantum 
world. This paper is is a contribution to the theory of such optimal cloning machines. 

There are several variations of the optimal cloning problem, which are perhaps best 
described in the form of a game. Fixed parameters in this game are the Hilbert space 7i 
describing the type of systems making up the inputs as well as the outputs to the cloning 
device. Its (always finite) dimension will be denoted by d = dimH. Most work so far has 
been done on the "qubit case" d = 2. Also fixed will be the number iV of input systems 
and the number M of output systems. The game is played by two physicists called Alice 
and Clare. (If the paradigmatical eavesdropper is Eve, why should the paradigmatical 
doner not be Clare?). Alice's first step is to choose a preparation for quantum systems 
with Hilbert space H, as described by a density matrix a, say. She then proceeds to 
run her preparing procedure N times, thus producing a composite system in the Hilbert 
space Ti. <g> • • ■ <S> T~L = T[® N (a tensor product of N copies) in the state a® N , and sends 
the prepared particles to Clare. Clare's move is to run a cloning device T of her choice, 
making M > N systems out of the given N systems. (The mathematical objects qualifying 
as "devices" in this context will be defined in the next section) . The next step is to compare 
Clare's M output systems with the state (j® M , which Alice supplies by running her original 
preparation M times. Clare scores in this game, whenever her output is sufficiently similar 
to a® M . There are different "figures of merit" on which Clare's scores might be based, 
resulting in different versions of the cloning game, and, possibly, in different "optimal" 
cloning devices T. Some of the simplest are based on a simple extension of the game: 
We allow Alice a further move, challenging the quality of Clare's clones, by choosing 
some observable. The two then each measure the expectation value of this observable on 
their respective M-particle states, and the overall score is based on the difference of these 
expectation values. 

Apart from the fine points of the comparison, two basic choices have to be made in the 
rules of this game, leading to four different versions of the game. The first choice concerns 
constraints on the initial preparation a done by Alice. For the discussion of eavesdropping 
on quantum cryptography channels it is often of interest to allow only a small number of 
states (e.g., two) [GH]. Orthogonal states can obviously be cloned perfectly, and so can 
some sets of non-orthogonal ones [DG]. However, we are interested in so-called universal 
cloning machines [BDEMS], which work on generic (and unknown) inputs. Still there 
is a choice to be made, namely whether or not Alice is required to prepare a pure state 
a = \ip){ip\ given by a wave vector ip. Here the present paper follows most of the current 
literature by imposing purity. The reason is mostly that the full mixed-state problem 
seems to be a lot more involved, even in the classical case, and it seems wise to gain a full 
understanding of the simpler case first. 

The second choice to be made in the rules is whether Alice really challenges Clare's 



3 



full M-particle output state, or just one clone at a time. That is, we could constrain 
her to challenge Clare's result by selecting only one of the M clones, and demanding a 
one particle observable of her choice to be measured on it. This constraint on Alice is in 
keeping with the definition of the quantum copier, which also imposes only conditions on 
one output at a time. This "one particle tesf version of the cloning problem has been 
considered in the qubit case in several recent papers [GM,DBE]. It turns out, however, 
that it is the more difficult problem for d > 2. Therefore, in this paper we will give a full 
analysis of the "pure state - many particle test" cloning problem for arbitrary d. 

The "pure state - one particle test" version is settled in the d = 2 case [GM] , where the 
representation theory of SU2 makes a full analysis relatively simple. The optimal cloning 
device found by Gisin and Massar is the same as the one found in the present paper for the 
many particle test version. Work on the case of general d is under way in Braunschweig 
with Michael Keyl. The optimal cloning devices fit perfectly into the framework for the 
classical limit (in this case, of SU^-spin systems), set up in [Wer,WW,Men]. In this 
way, a precise meaning can be given to the intuition that cloning very (infinitely) many 
copies is equivalent to the cloning procedure via classical measurement, and subsequent 
preparation. This connection, which requires the explanation of more formalism than this 
paper can take, will be explored in a later paper. 

2. Statement of the problem 

In order to state the optimal cloning problem precisely we must first state what the admis- 
sible "quantum devices" are among which we are looking for an optimal one. There are 
two ways of approaching this problem, which are fortunately equivalent: In either case, 
each device is characterized by its action on quantum states. Thus if the input systems 
are described in a Hilbert space 7i, and consequently the input states are density matri- 
ces on H, and the output systems are described in a Hilbert space /C, a quantum device 
is given by a map T taking density matrices over Ti into density matrices over /C. The 
first approach to characterizing the admissible maps T is the axiomatic one: a minimal 
requirement for T to be consistent with the statistical interpretation of quantum theory 
is that T must respect convex combinations (incoherent mixtures) of states. This allows 
the extension of T to a linear operator from the space of trace class operators over H into 
the trace class operators over /C. This linear operator has to take positive elements into 
positive elements, which is usually expressed by calling T a positive (super-)operator. If 
this condition remains valid if T is applied only to a part of a larger system, T is called 
completely positive. Since T takes density matrices into density matrices, it also has to 
respect normalization (i.e., the trace). Therefore, according to the axiomatic definition, an 
admissible machine must be given by a completely positive trace preserving linear operator 
T. The second definition of "admissible devices" is constructive. It allows only operations 
which can be done by first coupling the given system to an auxiliary one (often called 
the "ancilla"), then making the two interact, as described by a unitary transformation, 
and finally restricting to a suitable subsystem of the combined system by taking a partial 
trace over the ancilla, and perhaps further subsystems. Each of these steps is a completely 
positive trace preserving operation, so clearly every quantum device admissible in the con- 
structive approach is also admissible by the axiomatic approach. But the converse is also 
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true (by virtue of the Stinespring dilation theorem [Sti]): every linear completely positive 
trace preserving map can be constructed in the way described. 

Let us now turn to the description of figures of merit for quantum cloning devices, i.e., 
on quantitative ways of expressing the "closeness" between the output T(a® N ) of Clare's 
cloning device and the state a® M , which the non-existent ideal doner would achieve. This 
question has to be treated rather carefully for the mixed state versions of the cloning game. 
Possible candidates here are the trace norm difference ^(cr®^) — <7® M || , or perhaps an- 
other p-norm [RS] like the Hilbert Schmidt norm, or the relative entropy S(T(a® N ), a® M ) 
[OP]. In principle, the optimal doner might depend on the figure of (de-)merit chosen. 
However, in the pure state case they all lead to the same optimum. In this paper we will 
use an even simpler figure of merit, which makes sense only in the pure case, namely the 
fidelity tr [a® )M T '(cr 8 )N )), which would be 1 for the non-existent ideal doner. Good cloning 
means to bring this quantity as close to 1 as possible, for all input states a. The worst 
result 



is taken as the figure of merit. So Clare's and our problem is to maximize J-(T) by a 
judicious choice of T, given H, N and M. The optimum will be denoted by T = sup T J-(T), 
and depends on the three integers d = dim7Y, N, and M. 

We note in passing that so far we have only considered the problem of minimizing the 
worst case losses for Clare in a game of the type described. It would be interesting to take 
the game theoretic description more seriously, and to ask for the equilibrium points of the 
variants of this game in the sense of von Neumann's theory of two-person games [vNe]. 



In this section we will define the cloning maps, which will be shown to be the unique 
optimal ones in the next section. Since we are considering only pure input states a® N , it 
suffices to consider the action of T on such states and their linear combinations. These 
will be operators on the span of the vectors of the form <p <g> ■ • ■ <g) <p = <p® N e Ti® N . Our 
first task is to collect some of the basic properties of this space. 

The span of the tensor powers (p® N can be described very easily: it is precisely the 
space of vectors which are invariant under all permutations, or the "Bose subspace" of 
Ti® N in physical terminology. We will denote it by Ti® N . A convenient basis in this space 
is the "occupation number basis" canonically associated to some basis in the one-particle 
space Ti. It is labelled by tuples (ni, . . . ,rid) with J2 k Uk = N. A generating function 
for this basis is given by the tensor power vectors (p® N , the variables in the generating 
function being the components cp±, . . . , <p>d of <p> in the given basis of Ti. Explicitly, 




(2.1) 



3. Description of the optimal cloning machines 



m,...,n d K=l v K ' 

It is easily checked, using this basis that the dimension of Ti® N is 




(3.1) 




(3.2) 
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where d = dimH. We will denote by sn the orthogonal projection of H® N onto Hf N . 
A crucial feature of the symmetric subspace is that the unitary operators U® N leave it 
invariant, and act on it irreducibly. That is to say, any operator A supported by 1~C® N 
(A = Asn = sn A), which commutes with all U® N , must be a multiple of sn, i-e., of the 
identity operator on Hf N . 

The optimal cloning map has to take density operators on H® N to operators on 
7i® M . An easy way to achieve such a transformation is to tensor the given operator p 
with the identity operators belonging to tensor factors (N + 1) through M, i.e., to take 
p i— > p®l®^ M ~ N ' . This breaks the symmetry between the clones, making N perfect copies 
and (N — M) states, which are worst possible "copies" . Moreover, it does not map to 
states on the Bose sector Hf M , which would certainly be desirable, as the target states 
a® M are supported by that subspace. An easy way to remedy both defects is to compress 
the operator to the symmetric subspace with the projection sm- With the appropriate 
normalization this is our definition of the cloning map, later shown to be optimal: 

n P ) = ^ ] SM(p®l® iM - N) )s M . (3.3) 

Complete positivity is obvious from the form of T. So in order to verify that this is a 
legitimate cloning map, we only have to check that the normalization factor is chosen 
correctly to make T trace-preserving. To begin with, trT(p) is a linear functional of p, 
and can hence be written as tr(pX), for a suitable positive operator X on Hf N . From the 

covariance of T, i.e., the property 

t(u® n P U*® n ^) = U® M T(p)U*® M , (3.4) 

one concludes that X commutes with JJ® N and, by irreducibility, X must be a multiple of 
the identity. It remains to be shown that this multiple is 1 or, equivalently, that the trace 
of some density matrix is preserved by T. To this end we consider the maximally mixed 
density matrix tn = d[A^] _1 SAr on H® N , which is also characterized as the unique density 
matrix on Hf N invariant under sitewise rotations p h- > JJ® N pU*® N . Then T(rjv) = 

d[M]- 1 s M (s N <S> l M ~ N )s M = d[M]~ 1 s M = r M - Hence f as defined in (3.3) is trace 
preserving. 

The value of F(T) is determined by observing that, for a pure state a on H, a® M is a 
one-dimensional projection, which is smaller than both sm and {o® N ® j®{M-N)y jj ence 

^ d ^] d W f ,«M. _ d[N] ,„ ^ 

nT) ~W\ ( ( )sm) ~W\ ( } ~d[M] ■ (3 - 5) 

We conclude this section by computing the performance of T with respect to the one- 
particle-test version of the cloning problem. Some of our considerations will be valid for any 
cloning map T (not necessarily T = T), which maps density matrices on Hf N into density 
matrices on Hf M , and satisfies the covariance condition (3.4). For any density matrix p on 
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H® N , we denote by R(p) its one-site restriction denned by tr(R(p)A) = tr(p(,4®I® (JV_1) )). 
Consider the one-site restriction R(T(a® N )). By covariance of T, this must be a density 
matrix on the one-site Hilbert space 7i, commuting with all unitaries U, which commute 
with a. Hence we can write it as 



R(T(a® N ))= 1 (T)a + (l- 1 (T))r 1 , (3.6) 



where T\ = <i -1 l is the totally mixed density matrix on Ti. By covariance of T, the number 
7(T) does not depend on a, and is called the Black Cow factor* of T. Surprisingly, it 
is useful also for the discussion of "cloning in stages" from N to M to R systems, even 
though in the second stage the doner from M to R systems no longer finds a product 
density matrix a® M . In fact, on the right hand side of (3.6) we can write R{a® N ) for a, 
and it is clear that (3.6) becomes 



R(T(p)) = 1 (T)R(p) + (l- 1 (T))tT(p)r 1 , (3.7) 



for all p in the linear span of the operators cr® . But these are all density matrices on 
7if N : this can be seen by inserting the expansion (3.1) into a® N = \<p® N )(<p® N \, and 
observing that from the resulting power series in ip K and TpH the coefficients \n)(m\ can be 
extracted. Hence (3.7) holds for all cloning maps T satisfying the assumptions stated at 
the beginning of this paragraph. 

As a corollary we obtain the equation ^{TrmTmn) = i{Trm)i{Tmn), for cloning 
in stages. Since the family of optimal doners defined by (3.3) obviously satisfies the 
concatenation property TrmTmn = Trn, we find that the Black Cow Factor for these 
must be of the form ^(Tmn) = 1n/im- 

To compute ^(Tmn) for (3.3), we use the normalization property of T in the form 
tr(%<7^ ® ^(M-AO) = d[M]/d[N], for any pure a. Then, on the one hand, we find that 



tr(a J Rf(a® Ar ))= 7 (f) + (l- 7 (f))/rf 



* The reason for this terminology is that it plays an important role in discussions of the 
cloning problem started by Chiara Machiavello and Artur Ekert at the Black Cow Cafe in 
Croton-on-Hudson, NY, and further clarified in collaboration with Dagmar Brufi [BEM] . 
I learned about this line of argument from a set of "Black Cow Notes" by Nicolas Gisin 
and Sandu Popescu. 
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and, on the other hand, 



tr(aRT (<r® N )) = tr((<7 f*W-»)f (<r® N )) 

= ^E tr (* < * , *V w )) 



] ^^ tr ( ff W SM (^ 8 I«(«--)) Sjl ) 

<W 4lM + ( M-^). d t M ] 



M d[M] \ d[iV] v y d[N + 1] 

N M-NN+1 
~M + M d + N ' 

where in the second line we used the abbreviation for the tensor product of M oper- 
ators, all of which are I, except the /c th , which is a. At the fourth equality we used that 
the sum ^2 k cr^ commutes with permutations and hence with sm- Solving for 7(T) we 
find the Black Cow Factor of (3.3) to be 

7 (f)= N d + M . (3.8) 
n 1 d + N M y 1 

This is a quotient, as expected. Specializing to d = 2 we find this result also in agreement 
with the value found in [BEM] by combining the Black Cow concatenation argument 
with the previously determined optimal value for state determinations. Again this is to be 
expected, because the optimal doner found in the one-particle-test version of the problem 
(for d = 2) agrees with (3.3). We conjecture that the equality of the optimal solutions to 
the one-particle-test and many-particle-test versions of the cloning problem coincide also 
for d > 2. 
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4. Proof of optimality 

In this section we will prove the optimality of the cloning map T, defined in (3.3), with 
respect to the figure of merit T from (2.1). Let 

F=supF(T) (4.1) 

T 

be the best bound for JF(T) . Since T is an infimum of continuous functions, it is an upper 
semicontinuous function, and since the set of admissible T is compact (bounded and closed 
in a finite dimensional vector space), the supremum (4.1) is attained, i.e., optimal doners 
with JF(T) = T do exist. 

For a pure state cr, rotated by unitary U on H, we will write o\j = UaU*. The average 
of any cloning map with respect to sitewise rotations will be denoted by 

T(p) = JdU u*® M T(u® N pU*® N ^)U® M , (4.2) 

where u dU n denotes the integration with respect to the normalized Haar measure of the 
unitary group of H. Then T is again an admissible cloning map, and T = T if and only if 
T satisfies the covariance condition (3.4). 

Theorem. For any cloning map from N to M systems, 

F(T)<d[N]/d[M] , 
with equality if and only ifT = T. 

Proof : Let T be an optimal cloning device, i.e., JF(T) = T . Then, for every pure a, we 
have 

t T (<T® M T(<T® N )) = JdU tr(a® M T(a® N )) > JdU F(T) = T . 

Since the left hand side is independent of a, it is also equal to F{T), hence F{T) > T. 
By definition of T we also have T > F{T), i.e., F(T) = T . Hence the integral over the 
positive quantities T — tr(cx® M T(cr® iV )) vanishes, which implies that 

tr( ( T® M T((7® JV )) =T{T)=T 

for all a. 

Next consider the rotation invariant density matrix = d[N]~ 1 SN on the symmetric 
subspace. Since T commutes with rotations, tn has to be mapped into a likewise rotation 
invariant density matrix on T~i® M . In particular, because the representation U® M restricted 
to the symmetric subspace is irreducible, we must have 

where "Rest" stands for a density matrix orthogonal to %, and < A < 1. We now use 
that T(sn — cr® N ) must be a positive operator. Taking its trace with a® M we thus find 
that 

0<tr(^T( Sjv -^))=A^l-^ . (*) 
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Hence T < Ad[iV]/d[M] < d[N]/d[M]. Since we have already seen in (3.5) that F(T) = 
d[iV]/d[M], we have shown that T is equal to this value, and T is indeed optimal. 

It remains to be shown that T is the only cloning map achieving this value. From the 
last string of inequalities we see that for any optimal doner we must have A = 1. This 
is equivalent to saying that T(a® N ) is supported by the symmetric subspace for all a 
and, since T is an integral over rotated copies of T, the same conclusion also holds for T. 
Moreover, for an optimal doner T, the right hand side of (*) has to vanish. This is again 
an integral with respect to dU over a positive function, which hence has to vanish, too: 

tr(<j® M T{s N -<j® N )) =0 . (**) 

Since the second term in this expression was already shown to be equal to T for all a, 
we conclude that tr(<7® M T(sjv)) = T for all a. The operators a® M span the space of 
operators on Hf M . Hence this equation is equivalent to T(sn) = Fsm- 

To further exploit the optimality condition, we introduce the Stinespring dilation [Stijof 
T in the form 

T(p) = FV*{p®l K )V , 

where V : 7if M — > Hf N <E> K, for some auxiliary Hilbert space /C, and p is an arbitrary 
density matrix on 7if N . We have included the factor T in this definition, so that for an 
optimal doner V*V = I. The optimality condition(**), written in terms of V becomes 

(tp® M , V*((s N - a® N ) ® 1 K )V^ M ) = \\{{s N - a® N ) ® l K )V<p* M \\ 2 = , 
where a is the one-dimensional projection to cp e 7i. Equivalently, ((sjv — a® N ) <S> 
l;c)V(p® M = 0, which is to say that V<p® M must be in the subspace tp® N ® /C for ev- 
ery ip. 

So we can write Vip® M = tp® N <S>£(<p), with e /C some vector depending in a generally 
non-linear way on the unit vector <p> e T~L. From the above observation that V must be an 
isometry we can calculate the scalar products of all the vectors 

(<p, tP) M = (<p® M , iP® M ) = (V<p® M , V^ M ) 

i.e., (t(<p),m)K = (<p,i>) M - N = {^ M - N ,r M - N ) . 

This information is sufficient to compute all matrix elements (ipf M , T(\(pf N ) (<pf N \)^f M ), 
i.e., T is uniquely determined and equal to T. 
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